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Abstract. Shapley and Scarf [8] introduced a notion of stable allocation between traders 
" and indivisible goods, when each trader has rank-ordered each of the goods. The purpose 

of this note is to prove that the distribution of ranks after allocation is the same as the 
distribution of search distances in uniform hashing, when the rank-orderings are indepen- 



dent and uniformly random. Therefore the average sum of final ranks is just (n + l)H n — n, 
and the standard deviation is 0{n). The proof involves a family of interesting one-to-one 



o 



X 



correspondences between permutations of a special kind. 

0. Introduction. Suppose n traders have n indivisible goods to trade, and each trader k has 
ranked the goods of all traders (including himself) as a permutation 



Pk = Pkl Pk2 ■■■ Pkr 



u 
o 

of {1, 2, . . . , n}. If i precedes j in this list, we write "i > j (&)" and say that k prefers i to j. An 
allocation of goods to traders is a permutation g\ ■ ■ ■ g n of {1, . . . , n} such that trader k gets gk- 
Shapley and Scarf [8] defined what they called a "core allocation" g, which is stable in the following 
sense: If C is any coalition of traders (a nonempty subset), and if h is any allocation of the goods 
I/"") . of C to the members of C, then 

m 

(f) ' hk > gk (k) for all k E C ==> hk = gk for all k G C . (*) 

o ' 

. For example, suppose n = 3 and the preference rankings are 



pi = 2 1 3 ; 
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g: p 2 = 3i2. 

P3 = 231. 



Then just three of the six possible allocations satisfy (*) when C is the full set {1,2,3}, namely 
213, 2 31, and 13 2. But 213 is unstable because traders 2 and 3 can both improve their lot by 
swapping goods between themselves. Similarly, 2 3 1 is unstable, but it is not quite as bad: If the 
coalition {2, 3} exchanges goods, trader 3 is happier than he was before, while 2 is no worse off. 
The remaining allocation, 13 2, is stable. 

It is not immediately obvious that a stable allocation always exists, for all n\ n possible ranking 
sequences. But Shapley and Scarf presented an algorithm by David Gale that always finds one. 
In fact, there is always exactly one stable allocation. Gale's procedure is similar to the famous 
Gale-Shapley algorithm for stable marriage [2,4], but it incorporates a new twist. 

Alan Frieze and Boris Pittel [1] recently analyzed Gale's algorithm and discovered some re- 
markable simplifications in the course of their study. One of the main purposes of the present note 
is to exhibit some underlying combinatorial structure that accounts for the surprising phenomena 
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they discovered. Frieze and Pittel proved, among other things, that the total sum of ranks in 
the stable allocation g (i.e., the sum n + ■ ■ ■ + r n , where gt = Pkr k ) is between (| — e)nhm and 
(1 + e)n In n with high probability asm oo, assuming that the preferences pk are independent and 

uniformly random. We will deduce the exact distribution of r\ H hr ra , showing in particular that 

its mean value is (n + l)H n — n, and in fact we will see that the joint distribution of the multiset 
{ri, . . . , r n } has a particularly simple form. 

1. Uniform hashing. First let's consider a simpler problem, namely to find an allocation g that 
satisfies (*) when C is the full set {1, . . . , n} but not necessarily for any other C. Such an allocation 
is locally optimal, in the sense that no trader can improve his selection unless some other trader 
loses ground. It's easy to achieve such an allocation by simply letting g^ be the first item of list pk 
that is not in {g±, . . . ,gk-i}, for k = 1, . . . , re. 

This trivial allocation algorithm, "first-come first-served," is precisely the method of uniform 
hashing that arises in the study of information retrieval [5] , when the preference lists (called "hash 
sequences" in that context) are randomly chosen. The analysis of uniform hashing is particularly 
simple, and we will see below that stable allocation can be reduced to the same analysis. 

A more general way to obtain a locally optimal allocation is to let ir be any permutation of 
{l,...,n}, and then to let g w (k) be the first item of p^^) that is not in {g n (i) • • • ^(fe-i)}) f° r 
k = 1, . . . ,n. The permutation ir gives top priority to trader 7r(l), then to ir(2), and so on. Indeed, 
every allocation that satisfies (*) for C = {l,...,n} will be found by this method, for some ir. 
The reason is that we must have gt = Pki f° r some k, in any locally optimum g, because the 
mapping from k to p\-\ for all k always contains a cycle; any such cycle can be used to improve 
an allocation in which no trader has his first choice. Let 7r(l) be any value of k with gk = Pki- 
Remove k from all preference lists and apply the same reasoning recursively to the remaining n — 1 
traders. This defines a permutation ir(l) n(2) . . . 7r(n) such that g^{k) is the favorite of trader ir(k) 
in {1, . . . , re} \ {g n{1) , . . . , fl^fc-i)}. 

Since every locally optimum allocation is obtained in this way using some ir, the stable alloca- 
tion must itself result from some ir. And when the preference lists are random, any ir behaves like 
any other. Thus we might expect that stable allocation statistics are essentially identical to the 
statistics of uniform hashing. This, in fact, is true, but we must be careful to make the argument 
rigorous. 

2. An algorithm. Let's now consider a simple algorithm that computes the stable allocation, 
given any sequence of preference rankings p\ . . .p n . The following procedure is a sequential variant 
of Gale's parallel method, analogous to the McVitie- Wilson version [7] of Gale and Shapley's original 
stable marriage algorithm. The basic idea is to look for cycles among the traders' best choices, and 
to put such cycles into the allocation whenever they are found. 

Al. [Initialize.] Set (q 1} . . . , q n ) <- (-1, . . . , -1), (n, . . . , r n ) <- (0, . . . ,0), (g 1} . . . ,g n ) <- 
(0, . . . ,0), and t <— 0. (During this algorithm, t will be a trader who makes proposals 
to other traders, or zero when a new trader needs to enter the picture. Variable qk will 
be the number of a trader who currently wants trader fc's goods, or q^ = if trader k 
has expressed interest in somebody else's wares but nobody has reciprocated; qk = — 1 if 
trader k has not yet entered. Variable is the position of trader k in his list, the number 
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of proposals he has made. Variable gk is trader /c's allocation, or if no allocation has yet 
been made.) 

A2. [Introduce a new trader.] (At this point t = 0, and = iff = —1. The traders 
with <7fc > have been assigned a permutation of their goods.) If all are nonzero, the 
algorithm terminates. Otherwise, set t to some k with g^ = 0, and set q t <— 0. 

A3. [Propose.] Increase r< by 1, then set s = ptr t - If g.s > 0, repeat this step. (Trader t 
has expressed interest in his best remaining choice, s.) 

A4. [Is s spoken for?] If q s > 0, go to step A5. Otherwise set q s <— t and t <— s, then 
return to A3. 

A5. [Remove a cycle.] (There is now a cycle s = s\ — ► s 2 — ► • • • — ► s m = s, where Sj+i 
is the best remaining choice of sj. This cycle must be part of any stable allocation, so 
we incorporate it into g.) Set t <— q s ; then repeatedly set g s <— p srs and s <— p sr . s until 
finding g s > 0. If t = 0, return to A2, otherwise go to A3. □ 

In step A3 there is always a path 

= to — > t\ — > t 2 — ^ • • • — * i m = * 

connecting all traders such that gk = and gfc > 0. Trader ii entered in step A2, and ij+i is the 
best remaining choice of tj, for 1 < j < m\ also q tj = for 1 < j < m. These invariant relations 
justify the parenthesized assertions within the algorithm. 

The final allocation gi . . . g n is stable. For if hk < gk (k) for all hk in some coalition C, and if 
some hk < gk (k), then hk = Pkr for some r < r^, so hk was rejected by the algorithm. When the 
algorithm changed from r to r + 1, it had already found hk to be the best remaining choice of 
some other trader s, and it had assigned g s = hk- Therefore h s > g s (s). 

Moreover, the stable allocation is unique. If g is assigned differently on any cycle that leads to 
step A5, that cycle will be a coalition violating (2). 

3. A constructive lemma. We have observed that the stable allocation will be found by a 
first-come-first-served algorithm equivalent to uniform hashing, using at least one permutation it 
to give priority to the traders. For example, in the introduction we considered a case where n = 3 
and the stable allocation was 13 2. Any it in which trader 3 has priority over trader 1 will find this 
allocation. 

We can also consider permutations of the preference lists. Let a = cr(l) . . . <r(n) be a permuta- 
tion of {1, . . . , n}, and suppose that trader u{k) uses list P^fc) = P k ~ This will permute the locally 
optimum allocations, and it may also change the stable allocation. For example, if a is 2 1 3, so 
that 

p' 2 = 213, 
Pi = 312, 
Ps = 231, 

the stable allocation becomes <7 2 5i53 = 213, because g' 2 must be 2 and then g' 3 must be 3. Allocating 
goods in the order 213 was the worst of the locally stable alternatives when a was the identity 
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permutation 12 3, but it is best in the modified problem. Shuffling the preference lists corresponds 
to shuffling the goods that the traders started with. 

We are now ready to prove a key fact about stable allocation. Let us say that the prioriti- 
zation 7T is consistent with the shuffling a, with respect to preferences p = p\ . . . p n , if the locally 
optimum allocation gi ... g n obtained by uniform hashing with priorities ir is the stable allocation 
g^Ci.) ■ ■ -9'a(n) wnen Pa(k) = Pk- For example, 7r(l) 7r(2) 7r(3) =132 produces the locally optimum 
2 1 3, so ir is consistent with the shuffling <r(l) a(2) <r(3) =213 just considered. 

Lemma. Let p be any sequence of preference lists. There is a one-to-one correspondence between 
all permutations ir of {l,...,n} and all permutations a such that, if ir corresponds to a, the 
prioritization ir is consistent with the shuffling a. 

Proof. Given p and ir, suppose uniform hashing with priorities tt produces the locally optimum 
allocation g\...g n . Write the preference lists in rows, with each circled in its list pf.. Delete all 
elements to the right of 5% . 

We will construct a shuffling a whose stable allocation agrees with g. The construction involves 
two dynamically growing sets X and Y, whose significance will become clear momentarily. Initially 
X <— and Y is the set of all k where gk = Pki (i-e., all row numbers in which the circled element is 
all by itself). Set m <— 0; as the construction proceeds, we will have defined a(ir{Vj) , . . . ,a(ir{m)) 
as a permutation of {5^(1) , • • • , flVfm) }> an d we will have X C Y , ir(m + 1) € Y \ X. 

Find the minimum k in m < k < n such that either k = n or (ir(k + 1) G Y\X and ir(k + 1) > 
7r(m + 1)) or n(k + 1) Y. Define a(ir(j)) = (^(j+i) for m < j < k and <j(ir(k)) = g w ( m +i)- If 
k = n, the construction is complete. Otherwise, remove g n ( m +i), • • • -,9-w(k) from all preference lists 
where they aren't circled. If Tr(k + 1) £Y, set X <— Y and let Y be the set of all rows whose first 
elements are now circled. (Since it defines g by uniform hashing, ir(k + 1) will be in the new Y .) 
Set m <— k and repeat the instructions of this paragraph. 

A worked example will help clarify this construction. Let 

vr(l)...7r(9) = 534918276. 

Table 1 shows a sequence of preference lists for n = 9 in which it defines the locally optimum 
allocation indicated by circled elements. All elements to the right of the circled ones have been 
erased, since they are irrelevant for our present purposes. 

1: ® 4: 1© 7: 5 3® 
2: (D 5: ® 8: 9© 
3: ® 6: © 9: (2) 

Table 1. Preference lists and their stable allocation 
The stable allocation determined by these preference lists happens to coincide with the circled 
elements in Table 1, so in this case the priorities ir produce the stable allocation; but our con- 
struction works for any ir, whether or not its locally optimum allocation is stable. Initially m = 0, 
X = 0, and Y = {1,2,3,5,6,9}. According to the rules stated, we proceed to set k = 2, since 
7r(3) Y. So we define <r(5) = 53 = 1 and cr(3) = 9; then we delete 1 and 9 from lists 4 and 8, and 
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we set X «- {1, 2, 3, 5, 6, 9}, Y «- {1, 2, 3, 4, 5, 6, 8, 9}, m <- 2. Next, fc = 5 since vr(6) = 8 G Y \ X 
and 8 > 4 = vr(3). This time cr(4) = 2, cr(9) = 3, cr(l) = 5. We delete 2, 3, and 5 where they 
are not circled. After setting ra<-5we have k = 7, because 7r(8) g" Y. (Notice that 7 Y, even 
though row 7 now contains only its circled element. The construction changes Y only in the case 
ir(k + 1) g Y.) This time cr(8) = 4, <r(2) = 7, X <- {1, 2, 3, 4, 5, 6, 8, 9}, Y <-{l,..., 9}, m «- 7. 
On the final round we set cr(7) = 6 and cr(6) = 8; the shuffled preference lists are shown in Table 2. 
It is easy to verify that their stable allocation matches that of Table 1, using the algorithm given 
6ctrli6r 

5: © 2: 1© 6: 5 3® 

7: (D 1: ® 4: 9© 

9: © 8: © 3: © 

Table 2. Shuffled precedence lists having the same stable allocation as Table 1. 
The inverse construction is analogous. If a is any shuffling, circle its stable allocation and 
prepare an array like Table 2. Begin with m <— 0, X = 0, and Y as before. Then repeatedly 
consider all cycles 

cr(ai) <- cr(a 2 ) < <- <r(a t ) <- <j(oi) 

formed by elements {ai, . . . ,a t } C Y \ {7r(l), . . . ,7r(m)}. Here a(a) <— a(6) means that a (a) is 
the (circled) element in list a(b); for example, er(4) <— <r(9) in Table 2 because the circled element 
in list cr(9) = 3 is 2 = cr(4). The properties of stable allocation guarantee that at least one such 
cycle exists, and our inverse construction will guarantee that each cycle will contain at least one 
ai X. Call the largest such a, the cycle leader, and renumber the subscripts so that a\ is the cycle 
leader. Take the cycle with smallest leader, and set ir(m + 1) <— a±, . . . ,ir(m + t) <— at- Remove 
<r(ai), . . . , cr(at) from the tableau in places where they are not circled. Then set m <— m + t and 
repeat the same process until all cycles have been recorded in w. Then set X <— Y and let Y be the 
row numbers that now have but a single element. Repetition of these steps will produce a priority 
permutation ir consistent with a. 

It is not difficult to verify that these constructions invert each other. The reader will find 
easily, for example, that the permutation <j(l) . . . <r(9) = 579218643 in Table 2 leads back to 
vr(l)...vr(9) = 534918276. □ 

4. A theorem. The lemma we have just proved makes it easy to establish the main result of this 
note. We say that uniform hashing on pi . . .p n with priorities ir produces ranks r\ . . .r n if is 
minimum such that 

€ {]MlK (1) , • • • ,Pi7(k-l)r^ k - 1) } 

for 1 < k < n. 

Theorem. When preference lists p 1 . . .p n are independent and uniformly random, the probability 
that the stable allocation g\ . . . g n = p\ ri . . . p nrn has a given value of the (unordered ) multiset 
{ri, . . . , r n } is the same as the probability that uniform hashing yields {r\,...,r n }. 

Proof. Let {r\, . . . ,r n } be any given multiset. If p = p\ . . .p n is any sequence of preferences and a 
is any permutation of {1, ... ,n}, let s(a,p) = 1 if {r 1 , . . . , r n } is the multiset of ranks in the stable 
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allocation when trader a(k) has preference list pk] otherwise s(a,p) = 0. Then the probability that 
stable allocation on random preferences has ranks {ri,...,r n } is 

E s ^p) 

p 

for any fixed a. 

Similarly, if tt is any permutation of {1, ... , n}, let h(n,p) = 1 iff {r±, . . . , r n } is the multiset of 
ranks produced by uniform hashing with priorities tt. Then the probability that uniform hashing 
on random preferences has ranks {ri, . . . ,r n } is 

E h ^p) 

v 

for any fixed tt. 

We want to show that these sums are equal. This is now obvious, because the lemma implies 

that 

E E 5 ^) = E E s ^'^ = E E^p) = E E^^^) 

a p pa p 7r n p 

and we simply divide by n\ n+l . Q 

Notice that this proof of the theorem remains valid even when the preference lists pi...p n are 
not uniformly random. All we are assuming is a symmetry condition, that shuffled preference lists 
Po-(i) • • -Pcr(n) have the same distribution for all a. 

5. Corollaries. The analysis of uniform hashing is quite simple, so our theorem immediately 
characterizes many properties of the ranks in random stable allocations. For example, let us find 
the expected value of 

(z + ri)(z + r 2 )...{z + r n ) ; 

this polynomial is clearly a function of the multiset {r\ , . . . , r n }, so we can analyze it by considering 
its behavior with respect to uniform hashing. 

Let qkj be the probability that > j in uniform hashing. This is the probability that 

Pkl, ■ ■ ■ ,Pkj G {Pin, • • • ,P(fc-l)r fc _i}, so 

Standard binomial coefficient summation techniques [3] show that 

jr^\m) qk3 n + m + 2-k \ m J / \ m J ' U 
The expected value of (z + ri) . . . (z + r n ) is therefore 

n 

E II( 9fe (^-i) -Qkr k ){z + r k ) 

n,...,r n k=l 
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fc = l r=l 



n *+e 



<Zfcj 



fc=i 



i=o 



fc A = i V n + 2-k) (n + l)!j} = V 



(3) 



In particular, the expected value of ri + h r n , which is the coefficient of z n , is 



E 

it=i 



n + 1 

n + 2- fc 



(n+l)(fl n+ i-l) = (n + l)fl n -n. 



(4) 



The other coefficients can be expressed in terms of Stirling cycle numbers if we note that 

n+1 n+1 



E(z + n) . . . (z + r n )(z + n+ 1) = * JJ (fcz + n + 1) = 

fc=l fc=0 

For example, the coefficient of z n ~ 2 in ^(z + n) . . . (z + r n ) is 



n + 2 



(n + l) 2 
3 J (n + 1)! 



(n + 1) 



n + 2 
2 



(n+1) 
(n + 1)! 



+ (n+l) 2 



n + 2 
k + 1 



n + 2 



(n+ l) fc 
(n + 1)! 



,71+1 -fe 



(5) 



1 J (n + 1)! 



= (n + l) 2 (i (H 2 n+1 - H^) - H n+1 + l) 
= ^-(H 2 n -H^)-n(n+l)(H n -l)- 



(6) 



see [3, exercise 6.33]. 

So far we have used only the case m = of (2). A similar argument, using m = 1, shows that 



2x , 2x A f (n + l)(n + l + fc) 



(7) 



In particular, 



E(r 2 | r 2) = V (n+l)(n + l + fc) 

11 nJ ^ (n + 2- A;)(n + 3- fc) 

fc=i 



n , 

(n + l)E - 

k=i v 



2n + 4 



(n + 2 — k){n + 3 — fc) n + 2 -A;, 
= (n + l)(n - # n+1 + 1) = (n + l)(n - #„) + n . 

Hence, by (6) and (8), 

E(( ri + ■■■ + r n ) 2 ) = E(r\ + • • • + r 2 ) + 2 [z"" 2 ] £(z + n) . . . (z + r n ) 

= (n + 1) 2 (F 2 - #i 2) ) - (n + l)(2n + l)tf n + n(3n + 4) . 
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(8) 



(9) 



The expected value of the variance of the ranks is therefore 

E (l+^_ E ^ ri± _^ =n+0(logn)2 (w) 

while the variance of the rank sum is 

E{(n ■■■+ r n ) 2 ) - (E(n + ■■■ + r n )f = 2n(n + 2) - (n + l) 2 H^ - (n + l)H n 

= ^2 - y ^ n 2 + 0(n log n) . (11) 

The final rank r n in uniform hashing is uniformly distributed in {l,...,n}. Therefore the 
probability is > \ that at least one trader in a random stable allocation will have rank > \n. (He 
will be left with a piece of goods he doesn't like very much, since it's in the lower half of his list.) 
Indeed, the probability that max(ri, . . . , r n ) < \n is exactly 

(1 - <7im)(l - 92m) ... (1 - q nm ) , 

where m = [^n\; this is asymptotically 

(1 - - |)(1 - |)(1 - . . . « .288788 . (12) 



6. Conclusions and conjectures. The running time of the simple algorithm we have presented 
for stable allocation is essentially proportional to the sum of ranks in the unique allocation, r\ + 
■ ■ ■ + r n . We have proved that the statistical properties of any symmetric function of (ri . . . r n ) are 
identical to the corresponding statistics for uniform hashing, provided only that the distribution of 
preference lists p\ . . . p n is invariant under shuffling. When the preferences are uniformly random, 
the expected value of r\ + ■ ■ ■ + r n is exactly (n + l)H n — n, and the standard deviation is 0(n). 

Uniform hashing is equivalent to the classical stable marriage problem when all the girls have 
the same preference list. (See [6, pages 65-67].) Perhaps it is worthwhile to repeat here the main 
research problem about stable marriages that was advertised in the author's lectures of 1975 [6] and 
not yet resolved: If the girls have any fixed set of preferences and the boys propose at random, is 
the expected rank sum r\ + • • • + r n of the male-optimum stable marriage always > (n + l)H n — n? 
In other words, does the case of equal preferences for the girls (uniform hashing) give the greatest 
lower bound for E{r\ + • • • + r )? If so, the average would be tightly bounded, because the upper 
bound (n — l)H n + 1 is easy to prove [6, page 43]. 

In fact, computer experiments for small n suggest that the maximum value of E(r\ + ■ ■ ■ + r n ), 
when the girls have a fixed set of preferences and the boys propose independently at random, is 
obtained if and only if the girls' preferences are cyclic, in the sense that we could rename boys and 
girls so that girl j's kth. choice is congruent to j + k (mod n). 

Both conjectures about min and max E{r\ + • • • + r n ) have been verified by exhaustive enu- 
meration when n < 4, and in several hundred random experiments when n = 5. Presumably there 
is a (simple?) way to prove that, in some sense, the more the girls agree in their ranking, the less 
the men will have to propose, on the average. 
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Is there a simple expression for E(r\ + • • • + r n ) when the girls' preferences are cyclic? For 
n = 3,4,5 the values are respectively 306/3! 3 , 884224/4I 4 , 104035560000/5! 5 . When n = 4, the 
worst seven preference matrices for the girls are 



1234 


1234 


1234 


1234 


1234 


1234 


1234 


2341 


2314 


2314 


2341 


2314 


2314 


1234 


3412 


3412 


3421 


3142 


3412 


3412 


3412 


4123 


4123 


4123 


4123 


4132 


4213 


4231 



with respective total rank sums 

884224, 879488, 875264, 875072, 874752, 874624, 872192. 

All preference matrices not isomorphic to these seven, under renumbering of boys and girls, have 
smaller total rank sum over all 4! 4 preference matrices for the boys. 

Acknowledgment. I want to thank Boris Pittel for introducing me to this problem and for 
patiently correcting my original misunderstanding of the definitions. 
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